Add an example using Optuna and Transformers #304

ParagEkbote · 2025-06-03T18:58:08Z

What does this PR do?

In this end-to-end tutorial, we are going to utilize the optuna library to perform hyperparameter optimization on a BERT model using the IMDB dataset.

Firstly, we will load and preprocess the dataset and define the model we want to perform HPO on. Then, we shall set the metrics and wrap it inside the trainer class along with a search space that will search the best set of hyperparameters for the learning rate, weight decay and batch size. Lastly, we will visualize the results as well.

Please let me know if any modifications are required and I will make the necessary changes.

Who can review?

@stevhliu.

review-notebook-app · 2025-06-03T18:58:13Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

stevhliu · 2025-06-04T17:07:06Z

Thanks for your work! However, I don't think its all that different from the current hyperparameter search docs in Transformers except its a more complete example.

@merveenoyan @sergiopaniego what do you think?

ParagEkbote · 2025-06-04T18:48:57Z

Just for the record, I'd actually wanted to include support for the transformer's library in their optuna-integration package. But since there is backend support provided by the transformers library, I contributed an starting example to their repo.

This PR builds on that example and provides a more hands-on approach for users to understand how to apply HPO to transformer models 🙂

merveenoyan · 2025-06-05T08:49:52Z

@ParagEkbote cookbook mostly contains end-to-end applied AI recipes where library integrations shine 💫 rather than minimal examples. it would be great to make it a more applied ML type of recipe

…sh to hub to make it more applied.

ParagEkbote · 2025-06-11T16:51:54Z

I have now added the following improvements to the recipe to make it more applied:

The successful trials are now saved in sqlite using RDBStorage.
Observability is now available using Weight & Biases to track and analyze the HPO trials.
Also, we can perform the final training from the optimized parameters and push the model to HF Hub.

Could you please review the changes?

cc: @stevhliu, @merveenoyan

ParagEkbote added 6 commits June 3, 2025 09:32

add the initial example.

2659b54

update the example.

7c6cf76

update example

6e19b73

update the tutorial.

935ac90

update the tutorial subheadings.

76e8684

update the tutorial.

fdd9765

update the example.

0cf0f58

ParagEkbote added 2 commits June 11, 2025 14:26

update the tutorial with observability, storage for the trials and pu…

b91ad3c

…sh to hub to make it more applied.

update the example.

0f0fa1f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add an example using Optuna and Transformers #304

Add an example using Optuna and Transformers #304

Uh oh!

ParagEkbote commented Jun 3, 2025

Uh oh!

review-notebook-app bot commented Jun 3, 2025

Uh oh!

stevhliu commented Jun 4, 2025

Uh oh!

ParagEkbote commented Jun 4, 2025 •

edited

Loading

Uh oh!

merveenoyan commented Jun 5, 2025

Uh oh!

ParagEkbote commented Jun 11, 2025

Uh oh!

Uh oh!

Add an example using Optuna and Transformers #304

Are you sure you want to change the base?

Add an example using Optuna and Transformers #304

Uh oh!

Conversation

ParagEkbote commented Jun 3, 2025

What does this PR do?

Who can review?

Uh oh!

review-notebook-app bot commented Jun 3, 2025

Uh oh!

stevhliu commented Jun 4, 2025

Uh oh!

ParagEkbote commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

merveenoyan commented Jun 5, 2025

Uh oh!

ParagEkbote commented Jun 11, 2025

Uh oh!

Uh oh!

ParagEkbote commented Jun 4, 2025 •

edited

Loading